Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 284104 |
| Missing cells | 44652 |
| Missing cells (%) | 1.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 30.3 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Categorical | 4 |
|---|---|
| DateTime | 1 |
| Numeric | 9 |
VERSIE has constant value "1.0" | Constant |
DATUM_BESTAND has constant value "2021-10-18" | Constant |
PEILDATUM has constant value "2021-10-01" | Constant |
TYPERENDE_DIAGNOSE_CD has a high cardinality: 1770 distinct values | High cardinality |
BEHANDELEND_SPECIALISME_CD is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with BEHANDELEND_SPECIALISME_CD and 1 other fields | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with AANTAL_SUBTRAJECT_PER_SPC | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with AANTAL_SUBTRAJECT_PER_SPC | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC | High correlation |
VERSIE is highly correlated with DATUM_BESTAND and 1 other fields | High correlation |
DATUM_BESTAND is highly correlated with VERSIE and 1 other fields | High correlation |
PEILDATUM is highly correlated with VERSIE and 1 other fields | High correlation |
JAAR is highly correlated with AANTAL_PAT_PER_SPC and 1 other fields | High correlation |
AANTAL_PAT_PER_ZPD is highly correlated with AANTAL_SUBTRAJECT_PER_ZPD | High correlation |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD | High correlation |
AANTAL_PAT_PER_DIAG is highly correlated with AANTAL_SUBTRAJECT_PER_DIAG | High correlation |
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG | High correlation |
AANTAL_PAT_PER_SPC is highly correlated with JAAR and 1 other fields | High correlation |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with JAAR and 1 other fields | High correlation |
GEMIDDELDE_VERKOOPPRIJS has 44652 (15.7%) missing values | Missing |
AANTAL_SUBTRAJECT_PER_ZPD is highly skewed (γ1 = 21.38670207) | Skewed |
Reproduction
| Analysis started | 2021-11-03 21:49:06.848770 |
|---|---|
| Analysis finished | 2021-11-03 21:49:31.660005 |
| Duration | 24.81 seconds |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 852312 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 284104 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 284104 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 284104 | |
| . | 284104 | |
| 0 | 284104 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 568208 | |
| Other Punctuation | 284104 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 284104 | |
| 0 | 284104 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 284104 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 852312 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 284104 | |
| . | 284104 | |
| 0 | 284104 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 852312 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 284104 | |
| . | 284104 | |
| 0 | 284104 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 2021-10-18 |
|---|
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2841040 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-10-18 |
|---|---|
| 2nd row | 2021-10-18 |
| 3rd row | 2021-10-18 |
| 4th row | 2021-10-18 |
| 5th row | 2021-10-18 |
Common Values
| Value | Count | Frequency (%) |
| 2021-10-18 | 284104 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2021-10-18 | 284104 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 852312 | |
| 2 | 568208 | |
| 0 | 568208 | |
| - | 568208 | |
| 8 | 284104 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2272832 | |
| Dash Punctuation | 568208 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 852312 | |
| 2 | 568208 | |
| 0 | 568208 | |
| 8 | 284104 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 568208 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2841040 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 852312 | |
| 2 | 568208 | |
| 0 | 568208 | |
| - | 568208 | |
| 8 | 284104 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2841040 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 852312 | |
| 2 | 568208 | |
| 0 | 568208 | |
| - | 568208 | |
| 8 | 284104 | 10.0% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 2021-10-01 |
|---|
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2841040 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-10-01 |
|---|---|
| 2nd row | 2021-10-01 |
| 3rd row | 2021-10-01 |
| 4th row | 2021-10-01 |
| 5th row | 2021-10-01 |
Common Values
| Value | Count | Frequency (%) |
| 2021-10-01 | 284104 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2021-10-01 | 284104 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 852312 | |
| 1 | 852312 | |
| 2 | 568208 | |
| - | 568208 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2272832 | |
| Dash Punctuation | 568208 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 852312 | |
| 1 | 852312 | |
| 2 | 568208 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 568208 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2841040 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 852312 | |
| 1 | 852312 | |
| 2 | 568208 | |
| - | 568208 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2841040 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 852312 | |
| 1 | 852312 | |
| 2 | 568208 | |
| - | 568208 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| Minimum | 2012-01-01 00:00:00 |
|---|---|
| Maximum | 2021-01-01 00:00:00 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 424.1597936 |
| Minimum | 301 |
|---|---|
| Maximum | 8418 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 301 |
|---|---|
| 5-th percentile | 302 |
| Q1 | 305 |
| median | 313 |
| Q3 | 322 |
| 95-th percentile | 335 |
| Maximum | 8418 |
| Range | 8117 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 931.4989193 |
|---|---|
| Coefficient of variation (CV) | 2.196103764 |
| Kurtosis | 69.51143197 |
| Mean | 424.1597936 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 8.449858075 |
| Sum | 120505494 |
| Variance | 867690.2367 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 305 | 40296 | |
| 313 | 36877 | |
| 303 | 32738 | |
| 330 | 22673 | 8.0% |
| 316 | 19328 | 6.8% |
| 308 | 14804 | 5.2% |
| 306 | 11837 | 4.2% |
| 324 | 11824 | 4.2% |
| 301 | 11518 | 4.1% |
| 304 | 9303 | 3.3% |
| Other values (17) | 72906 |
| Value | Count | Frequency (%) |
| 301 | 11518 | 4.1% |
| 302 | 6212 | 2.2% |
| 303 | 32738 | |
| 304 | 9303 | 3.3% |
| 305 | 40296 | |
| 306 | 11837 | 4.2% |
| 307 | 4935 | 1.7% |
| 308 | 14804 | 5.2% |
| 310 | 3176 | 1.1% |
| 313 | 36877 |
| Value | Count | Frequency (%) |
| 8418 | 3798 | 1.3% |
| 1900 | 186 | 0.1% |
| 390 | 759 | 0.3% |
| 389 | 3054 | 1.1% |
| 362 | 4034 | 1.4% |
| 361 | 2018 | 0.7% |
| 335 | 2917 | 1.0% |
| 330 | 22673 | |
| 329 | 751 | 0.3% |
| 328 | 6054 | 2.1% |
| Distinct | 1770 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 101 | 1201 |
|---|---|
| 402 | 1172 |
| 403 | 1141 |
| 301 | 1133 |
| 203 | 1073 |
| Other values (1765) |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.349329823 |
| Min length | 2 |
Characters and Unicode
| Total characters | 951558 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 404 |
|---|---|
| 2nd row | 112 |
| 3rd row | 707 |
| 4th row | 111 |
| 5th row | 110 |
Common Values
| Value | Count | Frequency (%) |
| 101 | 1201 | 0.4% |
| 402 | 1172 | 0.4% |
| 403 | 1141 | 0.4% |
| 301 | 1133 | 0.4% |
| 203 | 1073 | 0.4% |
| 201 | 1069 | 0.4% |
| 401 | 953 | 0.3% |
| 404 | 951 | 0.3% |
| 802 | 929 | 0.3% |
| 409 | 926 | 0.3% |
| Other values (1760) | 273556 |
Length
| Value | Count | Frequency (%) |
| 101 | 1201 | 0.4% |
| 402 | 1172 | 0.4% |
| 403 | 1141 | 0.4% |
| 301 | 1133 | 0.4% |
| 203 | 1073 | 0.4% |
| 201 | 1069 | 0.4% |
| 401 | 953 | 0.3% |
| 404 | 951 | 0.3% |
| 802 | 929 | 0.3% |
| 409 | 926 | 0.3% |
| Other values (1760) | 273556 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 182186 | |
| 0 | 173922 | |
| 2 | 126036 | |
| 3 | 103337 | |
| 5 | 73119 | |
| 9 | 68861 | 7.2% |
| 4 | 67806 | 7.1% |
| 7 | 56054 | 5.9% |
| 6 | 49706 | 5.2% |
| 8 | 40880 | 4.3% |
| Other values (15) | 9651 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 941907 | |
| Uppercase Letter | 9651 | 1.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1807 | |
| M | 1602 | |
| B | 1158 | |
| E | 827 | |
| Z | 777 | |
| D | 656 | 6.8% |
| A | 631 | 6.5% |
| F | 611 | 6.3% |
| C | 320 | 3.3% |
| K | 310 | 3.2% |
| Other values (5) | 952 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 182186 | |
| 0 | 173922 | |
| 2 | 126036 | |
| 3 | 103337 | |
| 5 | 73119 | |
| 9 | 68861 | 7.3% |
| 4 | 67806 | 7.2% |
| 7 | 56054 | 6.0% |
| 6 | 49706 | 5.3% |
| 8 | 40880 | 4.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 941907 | |
| Latin | 9651 | 1.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 1807 | |
| M | 1602 | |
| B | 1158 | |
| E | 827 | |
| Z | 777 | |
| D | 656 | 6.8% |
| A | 631 | 6.5% |
| F | 611 | 6.3% |
| C | 320 | 3.3% |
| K | 310 | 3.2% |
| Other values (5) | 952 |
Common
| Value | Count | Frequency (%) |
| 1 | 182186 | |
| 0 | 173922 | |
| 2 | 126036 | |
| 3 | 103337 | |
| 5 | 73119 | |
| 9 | 68861 | 7.3% |
| 4 | 67806 | 7.2% |
| 7 | 56054 | 6.0% |
| 6 | 49706 | 5.3% |
| 8 | 40880 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 951558 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 182186 | |
| 0 | 173922 | |
| 2 | 126036 | |
| 3 | 103337 | |
| 5 | 73119 | |
| 9 | 68861 | 7.2% |
| 4 | 67806 | 7.1% |
| 7 | 56054 | 5.9% |
| 6 | 49706 | 5.2% |
| 8 | 40880 | 4.3% |
| Other values (15) | 9651 | 1.0% |
ZORGPRODUCT_CD
Real number (ℝ≥0)
| Distinct | 5937 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 439888418.7 |
| Minimum | 10501002 |
|---|---|
| Maximum | 998418081 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 10501002 |
|---|---|
| 5-th percentile | 28999037 |
| Q1 | 99799028 |
| median | 149599021.5 |
| Q3 | 990004004 |
| 95-th percentile | 990516014 |
| Maximum | 998418081 |
| Range | 987917079 |
| Interquartile range (IQR) | 890204976 |
Descriptive statistics
| Standard deviation | 428883557.2 |
|---|---|
| Coefficient of variation (CV) | 0.974982607 |
| Kurtosis | -1.733266876 |
| Mean | 439888418.7 |
| Median Absolute Deviation (MAD) | 119600015.5 |
| Skewness | 0.4718533774 |
| Sum | 1.249740593 × 1014 |
| Variance | 1.839411056 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 990004009 | 2102 | 0.7% |
| 990004007 | 2060 | 0.7% |
| 990003004 | 2000 | 0.7% |
| 990004006 | 1659 | 0.6% |
| 990356076 | 1499 | 0.5% |
| 990356073 | 1372 | 0.5% |
| 990003007 | 1300 | 0.5% |
| 131999228 | 1271 | 0.4% |
| 131999164 | 1253 | 0.4% |
| 199299013 | 1196 | 0.4% |
| Other values (5927) | 268392 |
| Value | Count | Frequency (%) |
| 10501002 | 7 | |
| 10501003 | 10 | |
| 10501004 | 10 | |
| 10501005 | 10 | |
| 10501007 | 3 | < 0.1% |
| 10501008 | 10 | |
| 10501010 | 10 | |
| 10501011 | 3 | < 0.1% |
| 11101002 | 9 | |
| 11101003 | 10 |
| Value | Count | Frequency (%) |
| 998418081 | 136 | |
| 998418080 | 122 | |
| 998418079 | 35 | < 0.1% |
| 998418077 | 7 | < 0.1% |
| 998418076 | 7 | < 0.1% |
| 998418075 | 6 | < 0.1% |
| 998418074 | 188 | |
| 998418073 | 187 | |
| 998418072 | 7 | < 0.1% |
| 998418071 | 7 | < 0.1% |
AANTAL_PAT_PER_ZPD
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 9379 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 505.127133 |
| Minimum | 1 |
|---|---|
| Maximum | 164407 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 13 |
| Q3 | 101 |
| 95-th percentile | 1703 |
| Maximum | 164407 |
| Range | 164406 |
| Interquartile range (IQR) | 98 |
Descriptive statistics
| Standard deviation | 3144.476564 |
|---|---|
| Coefficient of variation (CV) | 6.2251191 |
| Kurtosis | 406.3582677 |
| Mean | 505.127133 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 16.75970234 |
| Sum | 143508639 |
| Variance | 9887732.859 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 47157 | 16.6% |
| 2 | 23123 | 8.1% |
| 3 | 15083 | 5.3% |
| 4 | 11141 | 3.9% |
| 5 | 8630 | 3.0% |
| 6 | 7271 | 2.6% |
| 7 | 6054 | 2.1% |
| 8 | 5113 | 1.8% |
| 9 | 4707 | 1.7% |
| 10 | 4165 | 1.5% |
| Other values (9369) | 151660 |
| Value | Count | Frequency (%) |
| 1 | 47157 | |
| 2 | 23123 | |
| 3 | 15083 | 5.3% |
| 4 | 11141 | 3.9% |
| 5 | 8630 | 3.0% |
| 6 | 7271 | 2.6% |
| 7 | 6054 | 2.1% |
| 8 | 5113 | 1.8% |
| 9 | 4707 | 1.7% |
| 10 | 4165 | 1.5% |
| Value | Count | Frequency (%) |
| 164407 | 1 | |
| 155871 | 1 | |
| 154272 | 1 | |
| 147961 | 1 | |
| 144726 | 1 | |
| 117383 | 1 | |
| 115606 | 1 | |
| 110208 | 1 | |
| 109677 | 1 | |
| 108959 | 1 |
AANTAL_SUBTRAJECT_PER_ZPD
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 10080 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 593.6964386 |
| Minimum | 1 |
|---|---|
| Maximum | 239907 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 14 |
| Q3 | 110 |
| 95-th percentile | 1932.85 |
| Maximum | 239907 |
| Range | 239906 |
| Interquartile range (IQR) | 107 |
Descriptive statistics
| Standard deviation | 4014.367424 |
|---|---|
| Coefficient of variation (CV) | 6.761649831 |
| Kurtosis | 729.4669142 |
| Mean | 593.6964386 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 21.38670207 |
| Sum | 168671533 |
| Variance | 16115145.81 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 45470 | 16.0% |
| 2 | 22731 | 8.0% |
| 3 | 14947 | 5.3% |
| 4 | 10939 | 3.9% |
| 5 | 8566 | 3.0% |
| 6 | 7242 | 2.5% |
| 7 | 6020 | 2.1% |
| 8 | 5048 | 1.8% |
| 9 | 4688 | 1.7% |
| 10 | 4131 | 1.5% |
| Other values (10070) | 154322 |
| Value | Count | Frequency (%) |
| 1 | 45470 | |
| 2 | 22731 | |
| 3 | 14947 | 5.3% |
| 4 | 10939 | 3.9% |
| 5 | 8566 | 3.0% |
| 6 | 7242 | 2.5% |
| 7 | 6020 | 2.1% |
| 8 | 5048 | 1.8% |
| 9 | 4688 | 1.7% |
| 10 | 4131 | 1.5% |
| Value | Count | Frequency (%) |
| 239907 | 1 | |
| 232484 | 1 | |
| 232177 | 1 | |
| 228146 | 1 | |
| 227658 | 1 | |
| 223836 | 1 | |
| 221165 | 1 | |
| 218623 | 1 | |
| 213790 | 1 | |
| 204749 | 1 |
AANTAL_PAT_PER_DIAG
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 8318 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7590.617373 |
| Minimum | 1 |
|---|---|
| Maximum | 227300 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 390 |
| median | 1676 |
| Q3 | 6219 |
| 95-th percentile | 36289 |
| Maximum | 227300 |
| Range | 227299 |
| Interquartile range (IQR) | 5829 |
Descriptive statistics
| Standard deviation | 17745.88311 |
|---|---|
| Coefficient of variation (CV) | 2.337870852 |
| Kurtosis | 34.1989451 |
| Mean | 7590.617373 |
| Median Absolute Deviation (MAD) | 1529 |
| Skewness | 5.085166272 |
| Sum | 2156524758 |
| Variance | 314916367.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21 | 450 | 0.2% |
| 9 | 437 | 0.2% |
| 8 | 425 | 0.1% |
| 19 | 420 | 0.1% |
| 25 | 412 | 0.1% |
| 37 | 410 | 0.1% |
| 28 | 407 | 0.1% |
| 12 | 400 | 0.1% |
| 14 | 400 | 0.1% |
| 6 | 396 | 0.1% |
| Other values (8308) | 279947 |
| Value | Count | Frequency (%) |
| 1 | 340 | |
| 2 | 357 | |
| 3 | 356 | |
| 4 | 383 | |
| 5 | 360 | |
| 6 | 396 | |
| 7 | 352 | |
| 8 | 425 | |
| 9 | 437 | |
| 10 | 329 |
| Value | Count | Frequency (%) |
| 227300 | 23 | |
| 213510 | 25 | |
| 212640 | 17 | |
| 210805 | 17 | |
| 210440 | 19 | |
| 209671 | 24 | |
| 204672 | 17 | |
| 200178 | 16 | |
| 198533 | 20 | |
| 189111 | 19 |
AANTAL_SUBTRAJECT_PER_DIAG
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 9177 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10828.85059 |
| Minimum | 1 |
|---|---|
| Maximum | 367763 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 510 |
| median | 2301 |
| Q3 | 8851 |
| 95-th percentile | 51242 |
| Maximum | 367763 |
| Range | 367762 |
| Interquartile range (IQR) | 8341 |
Descriptive statistics
| Standard deviation | 26196.719 |
|---|---|
| Coefficient of variation (CV) | 2.419159706 |
| Kurtosis | 38.24225139 |
| Mean | 10828.85059 |
| Median Absolute Deviation (MAD) | 2114 |
| Skewness | 5.352422043 |
| Sum | 3076519767 |
| Variance | 686268086.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 350 | 0.1% |
| 17 | 349 | 0.1% |
| 34 | 345 | 0.1% |
| 31 | 336 | 0.1% |
| 38 | 334 | 0.1% |
| 6 | 333 | 0.1% |
| 10 | 333 | 0.1% |
| 13 | 333 | 0.1% |
| 4 | 331 | 0.1% |
| 46 | 328 | 0.1% |
| Other values (9167) | 280732 |
| Value | Count | Frequency (%) |
| 1 | 279 | |
| 2 | 293 | |
| 3 | 302 | |
| 4 | 331 | |
| 5 | 303 | |
| 6 | 333 | |
| 7 | 317 | |
| 8 | 283 | |
| 9 | 253 | |
| 10 | 333 |
| Value | Count | Frequency (%) |
| 367763 | 23 | |
| 348460 | 25 | |
| 341708 | 19 | |
| 327682 | 24 | |
| 323799 | 20 | |
| 312879 | 17 | |
| 309714 | 17 | |
| 297716 | 17 | |
| 288415 | 16 | |
| 267042 | 19 |
AANTAL_PAT_PER_SPC
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 269 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 663146.4615 |
| Minimum | 458 |
|---|---|
| Maximum | 1489502 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 458 |
|---|---|
| 5-th percentile | 43282 |
| Q1 | 254267 |
| median | 733987 |
| Q3 | 1005769 |
| 95-th percentile | 1333993 |
| Maximum | 1489502 |
| Range | 1489044 |
| Interquartile range (IQR) | 751502 |
Descriptive statistics
| Standard deviation | 423139.1409 |
|---|---|
| Coefficient of variation (CV) | 0.6380779593 |
| Kurtosis | -1.180388968 |
| Mean | 663146.4615 |
| Median Absolute Deviation (MAD) | 340129 |
| Skewness | 0.04390350245 |
| Sum | 1.884025623 × 1011 |
| Variance | 1.790467326 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 880969 | 5102 | 1.8% |
| 874284 | 4354 | 1.5% |
| 843990 | 4348 | 1.5% |
| 894410 | 4333 | 1.5% |
| 880569 | 4273 | 1.5% |
| 893071 | 4210 | 1.5% |
| 733987 | 4048 | 1.4% |
| 1084173 | 3890 | 1.4% |
| 1099752 | 3862 | 1.4% |
| 1063681 | 3851 | 1.4% |
| Other values (259) | 241833 |
| Value | Count | Frequency (%) |
| 458 | 60 | < 0.1% |
| 1562 | 125 | < 0.1% |
| 1610 | 130 | < 0.1% |
| 1923 | 131 | < 0.1% |
| 2497 | 173 | |
| 3187 | 239 | |
| 3675 | 67 | < 0.1% |
| 5006 | 81 | < 0.1% |
| 6811 | 380 | |
| 7049 | 331 |
| Value | Count | Frequency (%) |
| 1489502 | 2976 | |
| 1450621 | 3054 | |
| 1421847 | 3564 | |
| 1345229 | 3543 | |
| 1333993 | 3436 | |
| 1332881 | 3546 | |
| 1317377 | 3463 | |
| 1296722 | 1181 | 0.4% |
| 1283082 | 3577 | |
| 1262591 | 1201 | 0.4% |
AANTAL_SUBTRAJECT_PER_SPC
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 269 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1061410.007 |
| Minimum | 480 |
|---|---|
| Maximum | 2660129 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 480 |
|---|---|
| 5-th percentile | 47968 |
| Q1 | 364428 |
| median | 1041762 |
| Q3 | 1729108 |
| 95-th percentile | 2488652 |
| Maximum | 2660129 |
| Range | 2659649 |
| Interquartile range (IQR) | 1364680 |
Descriptive statistics
| Standard deviation | 744078.5251 |
|---|---|
| Coefficient of variation (CV) | 0.7010283682 |
| Kurtosis | -0.8847560124 |
| Mean | 1061410.007 |
| Median Absolute Deviation (MAD) | 687346 |
| Skewness | 0.3513993003 |
| Sum | 3.015508286 × 1011 |
| Variance | 5.536528515 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1211813 | 5102 | 1.8% |
| 1281747 | 4354 | 1.5% |
| 1216294 | 4348 | 1.5% |
| 1315720 | 4333 | 1.5% |
| 1300621 | 4273 | 1.5% |
| 1332547 | 4210 | 1.5% |
| 1098722 | 4048 | 1.4% |
| 2558008 | 3890 | 1.4% |
| 2660129 | 3862 | 1.4% |
| 2488652 | 3851 | 1.4% |
| Other values (259) | 241833 |
| Value | Count | Frequency (%) |
| 480 | 60 | < 0.1% |
| 1773 | 125 | < 0.1% |
| 1863 | 130 | < 0.1% |
| 2200 | 131 | < 0.1% |
| 2819 | 173 | |
| 3366 | 239 | |
| 3761 | 67 | < 0.1% |
| 5037 | 81 | < 0.1% |
| 7180 | 331 | |
| 7390 | 380 |
| Value | Count | Frequency (%) |
| 2660129 | 3862 | |
| 2603692 | 3845 | |
| 2558008 | 3890 | |
| 2488652 | 3851 | |
| 2481643 | 3726 | |
| 2184417 | 3757 | |
| 2066342 | 3810 | |
| 2035919 | 1169 | 0.4% |
| 1985490 | 1167 | 0.4% |
| 1978552 | 3691 |
| Distinct | 3295 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 44652 |
| Missing (%) | 15.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3498.83887 |
| Minimum | 0 |
|---|---|
| Maximum | 287220 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 140 |
| Q1 | 460 |
| median | 1215 |
| Q3 | 4015 |
| 95-th percentile | 13275 |
| Maximum | 287220 |
| Range | 287220 |
| Interquartile range (IQR) | 3555 |
Descriptive statistics
| Standard deviation | 6532.291406 |
|---|---|
| Coefficient of variation (CV) | 1.86698835 |
| Kurtosis | 162.8873368 |
| Mean | 3498.83887 |
| Median Absolute Deviation (MAD) | 990 |
| Skewness | 7.658529223 |
| Sum | 837803965 |
| Variance | 42670831.02 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 160 | 1875 | 0.7% |
| 105 | 1859 | 0.7% |
| 110 | 1595 | 0.6% |
| 180 | 1360 | 0.5% |
| 145 | 1359 | 0.5% |
| 300 | 1304 | 0.5% |
| 190 | 1256 | 0.4% |
| 165 | 1223 | 0.4% |
| 140 | 1200 | 0.4% |
| 185 | 1189 | 0.4% |
| Other values (3285) | 225232 | |
| (Missing) | 44652 | 15.7% |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 70 | 226 | 0.1% |
| 75 | 76 | < 0.1% |
| 80 | 362 | 0.1% |
| 85 | 920 | |
| 90 | 602 | 0.2% |
| 95 | 659 | 0.2% |
| 100 | 930 | |
| 105 | 1859 | |
| 110 | 1595 |
| Value | Count | Frequency (%) |
| 287220 | 8 | |
| 148910 | 3 | < 0.1% |
| 142835 | 4 | |
| 122155 | 4 | |
| 116765 | 3 | < 0.1% |
| 109725 | 7 | |
| 108570 | 7 | |
| 107655 | 4 | |
| 101270 | 8 | |
| 95465 | 7 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| VERSIE | DATUM_BESTAND | PEILDATUM | JAAR | BEHANDELEND_SPECIALISME_CD | TYPERENDE_DIAGNOSE_CD | ZORGPRODUCT_CD | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_ZPD | AANTAL_PAT_PER_DIAG | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_SUBTRAJECT_PER_SPC | GEMIDDELDE_VERKOOPPRIJS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 404 | 131999208 | 1051 | 1053 | 1642 | 1850 | 248748 | 378596 | 430.0 |
| 1 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 112 | 131999118 | 6 | 6 | 1592 | 2798 | 248748 | 378596 | 975.0 |
| 2 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 707 | 131999207 | 80 | 80 | 10458 | 11442 | 248748 | 378596 | 575.0 |
| 3 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 111 | 131999117 | 3 | 3 | 122 | 163 | 248748 | 378596 | 835.0 |
| 4 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 110 | 131999022 | 1 | 1 | 25 | 29 | 248748 | 378596 | 6700.0 |
| 5 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 399 | 131999156 | 163 | 180 | 2118 | 3198 | 248748 | 378596 | 680.0 |
| 6 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 803 | 131999022 | 19 | 19 | 2585 | 2882 | 248748 | 378596 | 6700.0 |
| 7 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 107 | 131999207 | 44 | 47 | 1008 | 1534 | 248748 | 378596 | 575.0 |
| 8 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 304 | 131999207 | 32 | 35 | 745 | 1215 | 248748 | 378596 | 575.0 |
| 9 | 1.0 | 2021-10-18 | 2021-10-01 | 2015-01-01 | 324 | 202 | 131999119 | 1 | 1 | 1198 | 1811 | 248748 | 378596 | 1460.0 |
Last rows
| VERSIE | DATUM_BESTAND | PEILDATUM | JAAR | BEHANDELEND_SPECIALISME_CD | TYPERENDE_DIAGNOSE_CD | ZORGPRODUCT_CD | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_ZPD | AANTAL_PAT_PER_DIAG | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_SUBTRAJECT_PER_SPC | GEMIDDELDE_VERKOOPPRIJS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 284094 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0413 | 990027168 | 1300 | 1629 | 8996 | 15099 | 200053 | 371329 | 3165.0 |
| 284095 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0512 | 990027209 | 1 | 1 | 2292 | 4421 | 200053 | 371329 | NaN |
| 284096 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0415 | 990027131 | 49 | 49 | 2447 | 4284 | 200053 | 371329 | 165.0 |
| 284097 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0118 | 990027198 | 625 | 887 | 930 | 1533 | 200053 | 371329 | 220.0 |
| 284098 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0212 | 990027198 | 132 | 196 | 170 | 325 | 200053 | 371329 | 220.0 |
| 284099 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0716 | 990027199 | 528 | 570 | 2378 | 3807 | 200053 | 371329 | 845.0 |
| 284100 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0614 | 990027180 | 3 | 3 | 681 | 1067 | 200053 | 371329 | NaN |
| 284101 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0117 | 990027135 | 7 | 7 | 5781 | 9819 | 200053 | 371329 | 40545.0 |
| 284102 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0615 | 990027185 | 17 | 19 | 996 | 1547 | 200053 | 371329 | 14135.0 |
| 284103 | 1.0 | 2021-10-18 | 2021-10-01 | 2018-01-01 | 327 | 0315 | 990027152 | 8 | 8 | 1075 | 2062 | 200053 | 371329 | 74370.0 |